Frontiers in Computational Neuroscience — Latest Matching Preprints

1

Parameter-efficient deep learning for pneumonia detection on chest X-rays: A comparative evaluation of explainable AI methods

Mahtabi, B.; Nasr-Esfahani, E.; Yaraghi, S.

2026-07-16 radiology and imaging 10.64898/2026.07.14.26358065 medRxiv

Top 1.0%

1.1%

Show abstract

Pneumonia is a leading cause of infectious disease mortality worldwide, accounting for approximately 2.5 million deaths annually and 15% of deaths in children under five. Chest X-ray imaging remains the primary diagnostic tool, but accurate interpretation requires radiological expertise that is disproportionately concentrated in high-income settings, creating a diagnostic gap where disease burden is highest. Automated deep learning offers a scalable complement to specialist-dependent diagnosis, yet clinical adoption requires both high accuracy and transparent, interpretable reasoning. Convolutional neural networks (CNNs) have shown strong potential for pneumonia detection from chest X-rays, but two barriers impede clinical translation: the interpretability of black-box models and the computational feasibility of large architectures in resource-constrained settings. Explainable AI (XAI) methods such as Grad-CAM, Grad-CAM++, and Score-CAM address the interpretability barrier, yet systematic quantitative comparisons across multiple CNN architectures remain scarce. Furthermore, CNN architectures widely used for medical image classification carry high parameter counts that limit feasibility in resource-constrained settings, motivating architectures that achieve competitive accuracy with substantially fewer parameters. Here we propose a parameter-efficient deep learning framework for pneumonia detection based on transfer learning, evaluated across three CNN architectures representing distinct architectural families: EfficientNet-B0 with fine-tuning (proposed method), ResNet50, and DenseNet121, trained under identical conditions on the Kaggle chest X-ray dataset (5,863 images). Our method achieved 90% classification accuracy, outperforming both baselines while requiring 4.8x fewer parameters than ResNet50. To evaluate explainability, Grad-CAM, Grad-CAM++, and Score-CAM were applied across all three architectures and compared quantitatively using Intersection over Union against manually annotated lung segmentation masks, Insertion score, and Deletion score, with pairwise statistical validation via Wilcoxon signed-rank tests and Bonferroni correction. Findings show that classification accuracy and XAI explanation quality must be evaluated independently, and that the proposed parameter-efficient architecture offers a favorable trade-off for resource-constrained clinical deployment.

2

Statistical Inference and Power Analysis for Comparative F1 and Fβ Scores under Correlated Classifier Pairs

Hsu, C.-Y.; Liu, Q.; Shyr, Y.

2026-07-17 dermatology 10.64898/2026.07.15.26358166 medRxiv

Top 2%

0.5%

Show abstract

As machine learning and artificial intelligence systems are increasingly used in healthcare, rigorous evaluation of their classification performance has become critical. The F1 and F{beta} scores are widely adopted metrics for assessing performance in imbalanced biomedical data. Recently, we introduced psF1, a unified statistical framework for inference and study design for single and comparative F1 and F{beta} scores under the assumption of independent classifiers. In practice, however, benchmarking two classifiers on the same dataset creates a correlated paired setting. Ignoring this intrinsic dependency leads to overestimation of the standard error and a substantial loss of statistical power. To address this, we develop psF1pair, an advanced framework for statistical inference and power analysis that explicitly accounts for correlations between classifier pairs. Extensive simulation studies demonstrate the performance of psF1pair, and its utility is further illustrated through application to a real-world imaging classification system. As expected, higher correlation between classifiers yields narrower confidence intervals and enhanced statistical power. A freely available R package is provided to facilitate implementation, supporting accurate evaluation and study design for predictive and classification models in biomedical research.

3

Accelerated MCDW-pCASL Using Subspace Low-Rank Reconstruction for Quantification of BBB Water Exchange and Permeability

Liu, Z.; Zhao, C.; Huang, Z.; Guo, F.; Wang, D. J.; Shao, X.

2026-07-16 radiology and imaging 10.64898/2026.07.13.26357046 medRxiv

Top 2%

0.3%

Show abstract

Purpose: To develop an accelerated motion-compensated diffusion-weighted pseudo-continuous arterial spin labeling (MCDW-pCASL) method using a spatial subspace low-rank reconstruction method for efficient quantification of blood-brain barrier (BBB) water exchange (kw) and permeability (PSw). Methods: An accelerated multidelay MCDW-pCASL sequence was developed to simultaneously encode intravascular and extravascular diffusion-weighted ASL signals across multiple post-labeling delays (PLDs). A spatial subspace low-rank reconstruction framework was optimized to enable joint estimation of cerebral blood flow (CBF) and BBB water exchange rate and permeability. Fourteen young healthy adults underwent test-retest scans (separated by ~1 week) at 3T with both the accelerated MCDW-pCASL and a conventional diffusion-prepared (DP) pCASL sequence. Whole-brain, gray-matter, and white-matter CBF and kw values were quantified to assess test-retest repeatability and cross-method agreement. An additional cohort of 30 older adults underwent single-session MCDW and DP scans to evaluate age-related perfusion and BBB kw/PSw differences. Intraclass correlation coefficients (ICCs) were used to assess reliability and agreement. Results: Accelerated MCDW-pCASL demonstrated excellent agreement with DP-pCASL for CBF (ICC = 0.89) and fair agreement for kw (ICC = 0.56). Test-retest repeatability of MCDW-pCASL was good for CBF, BBB kw and PSw (ICC {approx} 0.6). Across both sequences, younger subjects exhibited significantly higher CBF and kw compared with older adults. Conclusion: Incorporating a spatial low-rank subspace reconstruction enables accelerated MCDW-pCASL acquisition with reliable simultaneous quantification of CBF, BBB kw and PSw. Clinical applications of this method for assessing perfusion and BBB function are warranted.

4

Across-Site MRI Prediction of Substantial Lymphovascular Space Invasion in Endometrial Cancer: Radiomics versus Deep Learning Features

Di Giovanni, D. A.; Tanaka, A.; Horikoshi, T.; Tsuboyama, T.; Yokota, H.; Zakarian, R.; Matsumoto, Y.; Vallieres, M.; Reinhold, C.

2026-07-16 radiology and imaging 10.64898/2026.07.14.26358100 medRxiv

Top 3%

0.3%

Show abstract

Purpose: To compare the cross-site generalization of radiomic features and deep learning embeddings for MRI prediction of substantial lymphovascular space invasion (LVSI) in endometrial cancer. Materials and Methods: This retrospective two-center study included 206 women (mean age, 59.8 years) with endometrial cancer who underwent preoperative 3-T MRI from March 2016 to March 2023. Hospital A (n = 130) was used for development and Hospital B (n = 76) for strict external testing. T2-weighted, reduced field-of-view diffusion-weighted, and apparent diffusion coefficient images were manually segmented. Radiomic features and seed-pooled embeddings from 3D ResNet18, DenseNet121, and U-NEXtractor were modeled with elastic-net logistic regression or XGBoost. Out-of-fold Platt calibration and sensitivity-targeted thresholds were estimated using development data only. AUCs were summarized with 95% bootstrap confidence intervals. Results: External radiomics with elastic-net achieved an AUC of 0.609 (95% CI: 0.464, 0.740) and sensitivity of 0 of 12 (0%). DenseNet121 with elastic-net had the highest external AUC (0.685; 95% CI: 0.538, 0.822) but sensitivity of 3 of 12 (25%). U-NEXtractor with elastic-net detected 10 of 12 positive cases (83.3%) with specificity of 32 of 64 (50.0%) and balanced accuracy of 0.667. XGBoost showed higher apparent development performance but weaker external operating behavior. Conclusion: Under real-world cross-site MRI acquisition shift, DenseNet121 and U-NEXtractor embeddings showed better external generalization than handcrafted radiomic features for substantial LVSI prediction.

5

Dual-Filament 3D Printing of Patient-Specific CT Phantoms with Embedded Implants and Tunable Metal-Artifact Intensity

Pasyar, P.; Mei, K.; Im, J. Y.; Roshkovan, L.; Geagan, M.; Noël, P. B.

2026-07-20 radiology and imaging 10.64898/2026.07.17.26358319 medRxiv

Top 3%

0.2%

Show abstract

ABSTRACT Background: Metallic implants such as orthopedic screws, prostheses, and dental hardware produce beam-hardening, photon-starvation, and streak artifacts that degrade computed tomography (CT) image quality, and the metal artifact reduction (MAR) methods developed to mitigate them require objective, reproducible benchmarking. Purpose: Objective evaluation of MAR algorithms in CT is hindered by the absence of phantoms that simultaneously provide anatomically realistic backgrounds, embedded implants of known geometry, and controllable, ground-truth--referenced artifact intensity. We present a dual-filament, voxel-level three-dimensional (3D) printing method that fulfills these requirements and demonstrate its capabilities on a clinically representative cervical spine case with embedded orthopedic spinal screws. Methods: The proposed method extends the PixelPrint framework, a fused-deposition-modeling (FDM) workflow that converts clinical Digital Imaging and Communications in Medicine (DICOM) data directly into 3D-printer Geometric code (G-code) without intermediate segmentation or surface meshing, to interleaved, voxel-level deposition of two filaments: a calcium-doped polylactic acid (PLA) for soft tissue and bone, and a higher-attenuation metal-doped PLA for metallic implants. For demonstration, anonymized DICOM data of a healthy cervical spine were used to design and fabricate three matched phantoms, each with six embedded spinal screws at C4--C6: a 0% metal-infill ground-truth phantom, a 50% medium-metal-infill phantom, and an 85% high-metal-infill phantom. All phantoms were scanned on a clinical spectral CT system at 120 kVp and 1000 mAs, reconstructed at 0.67 mm slice thickness with virtual monoenergetic imaging (VMI) across 50--190 keV. Method performance was characterized by region of interest (ROI)-based Hounsfield Unit (HU) agreement with the source patient data and by the noise-independent Gumbel-distribution p-index metric. Results: The dual-filament method reproduced patient anatomy, soft-tissue contrast, and screw geometry with high fidelity. ROI HU values agreed with patient data within {+/-}25 HU for soft tissue and trabecular bone; cortical regions were underestimated owing to the current ceiling of the calcium-doped PLA used in this study. The tunable-artifact behavior was quantified as follows: the Gumbel location parameter scaled monotonically from 46.7 HU (no-metal background) to 57.1 HU (50% infill) to 90.5 HU (85% infill) for the VMI 70 keV with standard filter. High-keV VMI reconstructions substantially reduced streak and beam-hardening artifacts while preserving anatomic detail. Conclusions: The proposed dual-filament, voxel-level PixelPrint method enables the fabrication of patient-specific, multi-material CT phantoms with embedded metallic implants and controllable, ground-truth--referenced artifact intensity. Although demonstrated here in a single cervical-spine case, the workflow is anatomy- and implant-agnostic by construction and could in principle be adapted to other musculoskeletal sites (e.g., knee, hip, dental) and implant materials, providing a reproducible methodological foundation for benchmarking MAR algorithms, characterizing spectral CT performance, and validating emerging photon-counting detector systems. Keywords: 3D printing methodology; fused deposition modeling; voxel-level multi-material printing; spectral computed tomography; metal artifact reduction; phantom design; orthopedic implants; dual filament; PixelPrint.

6

Identification of Persistent Radiomics Feature Co-occurrence Across Diverse Tissue Types and Individuals: A Network-Based Analysis of the RADAPT CT Atlas

Amiri, S.; Afshar, P.; Rohban, M. H.

2026-07-19 radiology and imaging 10.64898/2026.07.17.26358252 medRxiv

Top 4%

0.1%

Show abstract

Objectives. Radiomics pipelines extract hundreds of quantitative features that are widely known to be redundant, but the structure of this redundancy is usually treated as a per-dataset nuisance to be pruned away. We tested the alternative hypothesis that a substantial number of feature-feature correlations are universal: they persist across patients and across anatomically distinct structures because they reflect shared mathematical and image-statistical properties of how the image is summarised, rather than properties of the tissue being imaged. Materials and Methods. We re-analysed the publicly available Radiomics Atlas Dataset of normal Abdominal and Pelvic CT (RADAPT), restricting the analysis to the 526 non-contrast-enhanced examinations of the 531-subject atlas and to the 107 original (non-filtered) PyRadiomics features. The 53 segmented structures were grouped into four broad anatomical categories -- bones, muscles, vessels, and parenchymal organs. RADAPT is distributed as one Excel file per structure, with patients as rows and features as columns. Within each structure file we z-score-normalised every feature across patients, computed the absolute Spearman correlation matrix, and retained edges with |{rho}| [≥] {tau} for {tau} in {0.70, 0.80, 0.90}. We then intersected the edge sets across all structure files to obtain a "universal" correlation graph, in which an edge survives only if it exceeds the threshold in every structure (each estimated across the full patient sample). Stable feature communities were defined as the maximal cliques of this graph. Robustness to patient sampling was tested by repeating the entire pipeline on five independent random splits of each file into two patient halves (10 sub-cohorts per threshold), and the implementation was independently reproduced in R. Results. Despite the strictness of the global-intersection criterion, 34, 24, and 14 stable feature communities survived at {tau} = 0.70, 0.80, and 0.90 respectively, with the largest cliques containing six members at {tau} = 0.70 and {tau} = 0.80 and five members at {tau} = 0.90. The community structure was clearly interpretable: separate cliques captured (i) variance-like intensity dispersion, (ii) long-run / low-frequency (coarse) texture, (iii) high gray-level texture, (iv) low gray-level texture, (v) volume and surface shape, and (vi) local-homogeneity and energy/entropy duals. On random-half resampling the exact-match recovery rate of these communities was 81.5 %, 86.7 %, and 80.7 % across the three thresholds; departures from exact recovery were almost always a single boundary feature added or dropped, consistent with finite-sample fluctuation of near-threshold edges rather than structural instability. The R re-implementation reproduced the Python results exactly. Conclusion. A substantial portion of radiomics feature collinearity is universal across patients and tissues. We distinguish two layers within it: trivial near-algebraic duals that are universal by construction, and non-trivial cross-matrix-family communities that are the genuine empirical finding. Together they provide an interpretable, definition-grounded basis for aggressive dimensionality reduction, for retrospectively reconciling apparently different feature selections in the literature, and for moving radiomics pipelines toward organ-agnostic, more reproducible models. Clinical relevance statement. Selecting a single representative feature from each universal community shrinks the original-feature space by roughly an order of magnitude without sacrificing biologically distinct information. For example, the five variance-family members (first-order Variance, GLCM SumSquares, GLCM ClusterTendency, GLDM and GLRLM GrayLevelVariance) can be replaced by a single representative, removing redundant degrees of freedom that would otherwise inflate model variance; and labelling each retained feature by its community lets two studies that selected different variance-family names be recognised as having found the same signal, simplifying model development and improving cross-cohort generalisability in clinical CT workflows.

7

Validating Artificial Intelligence Guidance for Ultrasound Acquisition and Remote Interpretation

Maldonado, T.; Muluk, S.; Rali, P.; Soni, N.; Nathanson, R.; Kuttab, H.; VandeHei, M.; Michels, C.; Swietlik, J.; Speranza, G.; Schaffer, O.; Collaborating Investigators Group, ; Al Noor, F.; Mischkewitz, S.; Kainz, B.; Blaivas, M.; Jacobowitz, G.

2026-07-19 radiology and imaging 10.64898/2026.07.16.26356882 medRxiv

Top 5%

0.1%

Show abstract

Background: Venous thromboembolism (VTE), including deep vein thrombosis (DVT), remains a major global health burden. Diagnostic pathways rely on ultrasound but are limited by availability and prolonged time-to-imaging. Novel artificial intelligence (AI) guidance systems have been designed to enable non-ultrasound-trained operators to acquire proximal lower extremity compression ultrasounds for remote clinician interpretation. Methods: This multicenter, double-blinded, prospective, nonrandomized study evaluated the performance of an AI guidance system (ThinkSono Guidance, ThinkSono, GmbH). Patients underwent AI-guided ultrasound(s) and standard of care ultrasound(s). Primary and secondary endpoints were image quality, sensitivity and specificity for proximal DVT, and prioritization specificity, a measure of specificity in identifying patients requiring standard of care ultrasound after AI-guided scan. Results: Of 634 recruited subjects, 594 were analyzed, with 67 DVTs across 700 scans. 86.83% of AI-guided scans achieved diagnostic image quality. Triage sensitivity was 92.86%, triage specificity 39.12%, prioritization specificity 97.96%. Standard of care ultrasounds could be avoided in 35.32% of patients. Total median AI-guided scan and review time was 7.57 minutes. Conclusions: Clinician-reviewed AI-guided scans were rapid, sensitive for DVT, and specific for prioritizing patients requiring standard of care ultrasounds. These findings suggest AI-guided ultrasound may be a scalable triage strategy to expand DVT evaluation access, particularly in resource-constrained and after-hours settings

8

Diagnostic Accuracy of MRI Radiomics for Predicting KRAS Mutation in Rectal Cancer: A Systematic Review and Meta-analysis

Saleh, M. M.; Hegazy, M.; Alsaied, M. A.; Elkenani, A. J.; Ehab, R.; Hesham, M.; Abdelrazek, H. M.; Nazemi, S.; Shalaby, M.; El-Hussuna, A.

2026-07-20 radiology and imaging 10.64898/2026.07.17.26358357 medRxiv

Top 7%

0.1%

Show abstract

Background: KRAS mutation status is an important biomarker in rectal cancer, with implications for prognosis and treatment response. MRI-based radiomics has emerged as a non-invasive approach for predicting tumor genotypes. However, the diagnostic performance of MRI radiomics for predicting KRAS mutation status remains unclear. This study aimed to evaluate the diagnostic accuracy of MRI radiomics for predicting KRAS mutations in rectal cancer. Methods: A systematic search of PubMed, Cochrane Library, Scopus, and Web of Science was performed through July 2025. Diagnostic test accuracy studies evaluating MRI-based radiomics or artificial intelligence models for predicting KRAS mutation status in adult patients with rectal cancer were included, using molecular testing as the reference standard. Risk of bias was assessed using the QUADAS-2 tool. Pooled sensitivity and specificity were estimated using a bivariate random-effects model. Results: Seven studies involving 1,224 patients were included. The pooled sensitivity was 0.736 (95% CI: 0.697-0.772) and the pooled specificity was 0.645 (95% CI: 0.586-0.701). The false positive rate was 0.355 (95% CI: 0.299-0.414). The area under the hierarchical summary receiver operating characteristic curve was 0.754, with a normalized partial AUC of 0.666. Between-study heterogeneity ranged from low to moderate depending on the estimation method (I2 = 8.4%-53.3%). Conclusion: MRI radiomics demonstrates moderate diagnostic accuracy for predicting KRAS mutation status in rectal cancer and may serve as a promising non-invasive biomarker for preoperative molecular stratification. Further large-scale studies with external validation are required to confirm its clinical utility.

9

Rationale and guidance for implementing the continual reassessment method for dose-finding in controlled human infection model studies

Weerasinghe, C.; Osowicki, J.; Simpson, J. A.; Crocker-Buque, T.; McCarthy, J.; Williams, E.; Price, D. J.

2026-07-17 infectious diseases 10.64898/2026.07.16.26358128 medRxiv

Top 7%

0.0%

Show abstract

Controlled human infection models (CHIMs) are increasingly used in infectious disease research to study pathogen dynamics and evaluate interventions under controlled conditions. However, these studies are resource-intensive and involve ethical and safety constraints, making efficient study design critical. Dose-finding is a key early component in CHIMs, where the aim is to identify a challenge dose that achieves a target infection probability. Traditional rule-based designs are commonly used but can be inefficient, motivating the use of model-based adaptive approaches such as the Bayesian Continual Reassessment Method (CRM). Although CRM has been extensively studied and widely adopted in Phase I oncology trials for identifying the maximum tolerated dose of therapeutics, its application in CHIM settings remains limited, particularly when the endpoint of interest is infection. This tutorial provides step-by-step guidance for implementing a Bayesian CRM in dose-finding CHIMs, using an oropharyngeal Neisseria gonorrhoeae challenge as a motivating case study. The framework outlines key design components, including dose-grid specification, dose-response model, prior elicitation, Bayesian updating, decision rules, and stopping criteria, with particular emphasis on a clinically interpretable parameterisation. Trial operating characteristics are evaluated through simulation studies under multiple dose-response scenarios and prior-predictive analyses, and compared with a commonly used '3+3' type rule-based design. This work highlights the advantages of Bayesian model-based designs for dose-finding in CHIMs over classic rule-based designs and provides a structured, reproducible framework for implementing CRM, supporting their application in future CHIM studies.

10

Comparative Efficacy of Vancomycin and Fidaxomicin Regimens for the Prevention of Recurrent Clostridioides difficile Infection: A Systematic Review and Network Meta-Analysis of Randomized Controlled Trials

Prosty, C.; Butler-Laporte, G.; Brophy, J.; Frenette, C.; Loo, V.; Coburn, B.; Hota, S.; Longtin, Y.; Kong, L.; Muller, M.; Steiner, T.; Valiquette, L.; Daneman, N.; Daley, P.; Nott, C.; MacFadden, D. R.; Kandel, C.; Chen, Y.; Perez- Patrigeon, S.; Lee, T. C.; McDonald, E.

2026-07-17 infectious diseases 10.64898/2026.07.14.26358112 medRxiv

Top 7%

0.0%

Show abstract

Background and Aims The optimal treatment for first episodes and first recurrences of Clostridioides difficile infections (CDI) is unknown and there is emerging evidence for pulse and taper (P-T) regimens. Therefore, we sought to estimate the relative efficacy of treatment options. Methods MEDLINE and CENTRAL were searched from database inception to May 21, 2025 and unpublished conference abstracts were searched from recent infectious disease conferences. RCTs on the treatment of first episodes or first recurrences of CDI comparing fixed-dose or P-T regimens of fidaxomicin or vancomycin were included. The primary and secondary outcomes were 40- and 56-day CDI recurrence, respectively. A random-effects network meta-analysis on the risk ratio (RR) scale was conducted using a standard regimen (10-14 days) of vancomycin as the comparator. Treatments were ranked using the surface under the cumulative ranking curve (SUCRA). Results 8 RCTs were included comprising a total of 2181 patients. For 40-day recurrence, fidaxomicin P-T had the highest probability of ranking best (RR=0.10, 95%Confidence Interval [95%CI]=0.10-0.49, SUCRA=1.00), followed by vancomycin P-T (RR=0.49, 95%CI=0.32-0.76, SUCRA=0.61), fixed-dose fidaxomicin (RR=0.61, 95%CI=0.49-0.76, SUCRA=0.39), and, finally, fixed-dose of vancomycin (SUCRA=0.00). The treatments ranked in the same order for 56-day recurrence, though only 3 RCTs reported on this timepoint. Conclusion Vancomycin P-T, fidaxomicin P-T, and fixed-dose fidaxomicin were all superior to a fixed-dose vancomycin. Head-to-head comparative effectiveness RCTs are needed to quantify their relative effect sizes of and impact on long-term prevention of recurrent CDI.

11

Nationwide Mpox Genomic Surveillance Reveals Clade Ib Introductions, APOBEC3-Driven Evolution, and Terminal Deletions

Brochu, H. N.; Shi, Q.; Song, K.; Zhang, Q.; Munroe, J.; Harris, N. J.; Britt, N.; Zeng, Q.; Kapuria, K.; Chappell, J.; Norvell, B. M.; Peavy, L.; Williams, J. D.; Harris, A. B.; Chaitram, J.; Hutson, C. L.; Deng, J.; McGrath, D.; Boles, D.; Dale, S. E.; Gigante, C. M.; Iyer, L. K.

2026-07-17 infectious diseases 10.64898/2026.07.15.26357894 medRxiv

Top 7%

0.0%

Show abstract

Background The 2022-2023 global mpox outbreak highlighted the critical need for robust genomic surveillance capabilities to track mpox virus (MPXV) evolution and transmission dynamics. Methods Building upon our established SARS-CoV-2 sequencing infrastructure, we implemented a Molecular Loop probe-based long-read sequencing approach using Pacific Biosciences Sequel II technology for comprehensive MPXV genomic surveillance across the United States (US). From August 2024 to June 2025, we generated 326 high-quality whole genome sequences from residual mpox-positive clinical specimens collected by Labcorp across all 10 US Department of Health and Human Services regions. Results Our analysis identified two samples containing clade Ib MPXV in January and June 2025 and captured shifting trends in clade IIb diversity, with 13 distinct lineages observed. We also identified multiple instances of large (~1.6-17.6kb) deletions proximal to the inverted terminal repeats in clade IIb genomes. APOBEC3 mutation analysis indicated substantial evidence of human-to-human transmission among both clades. Further, we observed significantly higher APOBEC3-associated SNPs per kilobase (P<0.001) in clade IIb genomic variable regions relative to their central conserved region. Our assay exhibited strong reproducibility across biological replicates from individual patients and accuracy was confirmed via parallel sequencing of select specimens by US Centers for Disease Control and Prevention (CDC) using metagenomic sequencing. We also demonstrated via custom simulation that our assay discriminates all known MPXV clades and lineages, including those we have not observed in the US. Conclusions Our integrated nationwide surveillance system facilitates real-time genomic tracking of outbreak evolution, with demonstrated capacity across SARS-CoV-2 and MPXV, positioning this platform for rapid deployment during future pathogen emergence.

12

Complex intra-host SARS-CoV-2 evolution following monoclonal antibody pre-exposure prophylaxis

Kamelian, K.; Pascall, D. J.; Cheng, M. T. K.; Meng, B.; Altaf, M.; Morse, R. M.; Aggio, J. B.; Egan, D. J. S.; Chen-Xu, M.; Trivioli, G.; Sutton, B.; Richter, A.; Gonzalez-Vazquez, L. D.; Cormie, C.; Kemp, S.; Yeadon, R.; Hyatt, B.; Wong, A.; Thesin Pelamkulangara, N.; Fraser, E.; McCarthy, B.; Novaes, F.; Stott, S.; Galvin, A.; Bellis, K. L.; De Angelis, D.; Harrison, E. M.; Martin, D.; Smith, R. M.; Gupta, R. K.

2026-07-17 infectious diseases 10.64898/2026.07.14.26356329 medRxiv

Top 7%

0.0%

Show abstract

Background: Monoclonal antibodies have emerged as a prophylactic strategy to prevent symptomatic SARS-CoV-2 infection in immunocompromised individuals. However, the evolutionary and clinical implications of breakthrough infections under this regime remain unclear. Methods: A male in their 80s with a haematological/oncological diagnosis received a 2000 mg intravenous infusion of sotrovimab in March 2023 and was diagnosed with COVID-19 by RT-qPCR from a nasopharyngeal swab in August 2023. Weekly samples (n=24) were collected through February 2024 (171 days). All samples underwent whole-genome sequencing, with select mutations subjected to functional assessment. Findings: Sequencing identified the GE.1 lineage at all timepoints. An intra-host recombination event in ORF1ab (positions 8942-12458) was detected prior to 23 weeks post-detection, followed by a 14-fold increase in viral load (7.42e+06 to 1.00e+08 RNA copies/mL) and a marked shift in the viral population. E340D, a sotrovimab resistance mutation, was detected at low abundance (46%) within the first week post-infection, fluctuated over time, and was nearly fixed by week 15 (107 days) post-detection. We assessed five spike mutations - V36M, S98F, and V213G in the N-terminal domain, Y505P in the receptor-binding domain, and P681Q near the S1/S2 cleavage site - and additionally evaluated the impact of E340D. V36M conferred the highest infectivity across all cell lines, with the most significant effect in low-TMPRSS2 cells. While all mutations showed enhanced infectivity with the addition of E340D, the effect was most pronounced in mutations with lower baseline infectivity. The addition of E340D significantly decreased relative neutralizing titres for V36M, S98F, and V213G, enabling escape from neutralizing antibodies in XBB-responsive individuals, illustrating an enhanced phenotypic advantage. Patient neutralizing activity was absent pre-sotrovimab, and sotrovimab-induced neutralization was further compromised by selection of E340D. Interpretation: Sotrovimab pre-exposure prophylaxis in an immunocompromised patient did not prevent SARS-CoV-2 infection, and selected for resistant mutation E340D, with unexpected fitness consequences across non-receptor binding domain spike regions.

13

FootNet: A Multi-View Smartphone Dataset and Four-Model Benchmark for Clinical Foot Segmentation

Vijay, A.; Prabhune, A.; Srihari, V. R.; Rayampalli, A.

2026-07-17 health informatics 10.64898/2026.07.15.26358117 medRxiv

Top 7%

0.0%

Show abstract

We present FootNet, a 453-image multi-view smartphone foot dataset for binary foot segmentation, with expertannotated masks across six anatomical views (dorsal, medial, and plantar, both left and right). We benchmark four segmentation models under a controlled protocol: U-Net with a MobileNetV2 encoder achieves the best performance (IoU 0.9268, Dice 0.9608, 95 % CI [0.9209, 0.9320]); DeepLabV3 with MobileNetV3-Large scores IoU 0.8984 (Dice 0.9449); UNet++ with MobileNetV2 scores IoU 0.8913 (Dice 0.9391); and SAM ViT-B with oracle boundingbox prompt scores IoU 0.9219 on the matched 191-image subset. Bonferroni-corrected Wilcoxon signed-rank tests (k = 6 comparisons) show U-Net significantly outperforms DeepLab (p < 0.001, r = 0.638) and SAM ViT-B with oracle boundingbox (p = 0.005, r = 0.202); UNet++ does not significantly differ from DeepLab (p = 0.062). Connected-component postprocessing yields negligible benefit (mean {triangleup}IoU = +0.0003, 12 of 453 images improved). The extended dataset is available upon request

14

Genome-Wide Association Studies and Deep-Learning Functional Annotation of Opioid Use Disorder across Three Ancestries in the All of Us Research Program

Gu, S.; Petrovitch, D.; Hall, O. T.; Lambert, J. W.; Kember, R. L.; Nahid, N. A.; Ma, Q.; Sprague, J. E.; McDonough, C. W.; Johnson, J. A.

2026-07-17 addiction medicine 10.64898/2026.07.15.26358096 medRxiv

Top 7%

0.0%

Show abstract

Background: Opioid use disorder (OUD) is heritable, yet most genome-wide association studies (GWAS) have focused on European populations, leaving the genetic architecture of OUD in non-European populations underexplored. Methods: We conducted GWAS of OUD across three ancestries using electronic health records and genomic data from 52,357 All of Us Research Program participants (8,912 cases; 43,445 matched opioid-exposed controls; 48.5% female). Participants were stratified into European (EUR), African (AFR), and Admixed American (AMR) ancestry groups for logistic regression GWAS, with independent replication in the Million Veteran Program. We then applied the deep-learning model AlphaGenome to predict the tissue-specific transcriptomic and splicing consequences of top risk variants across 13 reward-pathway brain regions. Results: We identified and replicated a novel DDX6 risk locus, alongside established OPRM1 and FURIN signals. AlphaGenome predicted the DDX6 regulatory allele downregulates the stress-resistance gene FOXR1 in the nucleus accumbens, while the protective OPRM1 variant (rs1799971) upregulates OPRM1 expression across reward networks. Other signals of interest included IL6R and SHISA9 (EUR); GHR (AFR); and ASTN2 (AMR). Conclusions: This study identifies DDX6 as a novel OUD risk locus, replicates associations with OPRM1 and FURIN, and highlights biologically plausible ancestry-specific signals in AFR and AMR populations. We also replicated top variants in an independent population. Finally, integrating GWAS with deep-learning annotations provides specific, localized biological hypotheses to guide future experimental validation and targeted therapeutics.

15

Efficient stochastic epidemic simulation via the Sellke construction

van Boven, M.; Bootsma, M. C.

2026-07-17 epidemiology 10.64898/2026.07.16.26358219 medRxiv

Top 7%

0.0%

Show abstract

Stochastic epidemic models are a cornerstone of infectious disease epidemiology and are often used to study intervention scenarios. However, large run-to-run variability can make intervention effects difficult to estimate precisely. We revisit the epidemic Sellke construction, which assigns each individual an infection threshold for the cumulative infection hazard such that, conditional on the thresholds, the epidemic trajectory becomes deterministic. This enables coupling of simulations with and without an intervention, yielding low-variance effect estimates even when outcomes such as final size or peak incidence vary widely between runs. We develop an exact, event-driven implementation that maintains infection and recovery events in priority queues. Cumulative infection-hazard updates require O(log N) time per event, yielding overall complexity O(Elog N) for E events in a population of size N. The implementation achieves computational performance comparable to the classical Gillespie algorithm while naturally accommodating non-Markovian infectious periods and complex infectiousness profiles. We illustrate the approach using distance-dependent spread of avian influenza between poultry farms in the Netherlands and a multilayer population with households, schools, and workplaces. In both examples, coupling enables efficient within-run comparisons of intervention scenarios across stochastic realisations.

16

Bridging surveillance gaps in dengue: a hierarchical model integrating mixed data sources for transmission estimation and vaccine targeting

Djaafara, B. A.; Elyazar, I. R.; Yosephine, P.; Surya, A.; Silalahi, F. S.; Handito, A.; Thohir, B.; Aryani, D.; Gunawan, D.; Nisa, A. K.; Prianto, E.; Samad, I.; Cook, A. R.; Huang, A. T.; Clapham, H. E.; Bhatt, S.; Mishra, S.

2026-07-17 epidemiology 10.64898/2026.07.15.26358208 medRxiv

Top 7%

0.0%

Show abstract

Estimating dengue force of infection (FOI) is essential for understanding transmission dynamics and targeting intervention programmes, yet surveillance data in endemic settings required for estimations are often incomplete, with varying formats. We developed a Bayesian hierarchical catalytic model that jointly fits age-stratified case data, aggregate case data, and seroprevalence surveys within a single framework, incorporating external covariates to improve parameter identifiability. Synthetic validation showed that covariates alone recovered accurate FOI point estimates even when most districts contributed only aggregate data, but did so with poorly calibrated uncertainty; anchoring the model with a single seroprevalence survey was necessary to bring credible interval coverage close to nominal. Applied to 128 districts across Java and Bali, Indonesia (2016-2024), the model revealed substantial spatial heterogeneity in FOI and reporting rates. Many districts in Java exceeded the WHO-suggested seroprevalence threshold for vaccine introduction, yet were classified as low-priority when using reported incidence as prioritisation criterion, particularly in areas with weak surveillance. Model-based seroprevalence estimation, integrating multiple data sources, offers a more consistent basis for identifying high-priority districts for vaccine introduction, and is less susceptible to surveillance bias than reported incidence.

17

Neonatal admission as a marker of risk for poor educational attainment and special educational needs in children aged 5-11 years

John, A.; Pike, C.; Olga, L.; Sovio, U.; Wong, H. S.; Smith, G. C.; Aiken, C.

2026-07-17 pediatrics 10.64898/2026.07.15.26358132 medRxiv

Top 7%

0.0%

Show abstract

Background: Children born prematurely (before 37 weeks) or admitted to the neonatal unit (NNU) are at increased risk of adverse long-term physical health outcomes. It is also recognised that there is an association with later academic performance and special educational needs, however it is not clear whether these broad risk factors could be used as stand-alone heuristics to identify children who may benefit from additional support in educational settings. We aimed to examine the associations between neonatal unit (NNU) admission and educational attainment in mid-childhood. Methods and Findings: Pregnancy data from a prospective birth cohort (Pregnancy Outcome Prediction Study, Cambridge, United Kingdom, 2008-2012) were linked to national educational outcomes (Department for Education, United Kingdom). Multivariable regression models adjusted for maternal, child, and socioeconomic factors were used to evaluate associations between (i) all NNU admissions, (ii) at term NNU admissions >48 hours, (iii) preterm birth without ongoing physical health needs, and educational outcomes at ages 5-11 years. Children who required any NNU care were more likely not to meet expected educational standards across multiple ages and domains in early and mid-childhood: age 5 early year foundation (aOR 1.64, 95% CI 1.19-2.27, p=0.003), phonics at age 6 (aOR 2.43, 95% CI 1.72-3.57, p<0.001), and at age 7 (here assessments were divided into multiple domains): reading (aOR 1.67, 95% CI 1.18-2.38, p=0.004), writing (aOR 1.72, 95% CI 1.25-2.38, p<0.001), mathematics (aOR 1.56, 95% CI 1.09-2.22, p=0.020), and science (aOR 1.85, 95% CI 1.22-2.78, p=0.003). Similar patterns were observed among both at term-born infants who stayed >48hrs in NNU (phonics assessment at age 6 aOR 2.26, 95% CI 1.51-3.36, p<0.001) and in children born preterm without long-term physical health sequelae (phonics assessment at age 6 aOR 3.07, 95% CI 1.96-4.81, p<0.001). These associations were robust to adjustment for demographic, perinatal, and socio-economic factors. By age 11, differences in academic attainment were attenuated and no longer clearly distinguishable across all exposure groups. However, there was an increased likelihood of special educational needs (SEN) at age 11 associated with any NNU admission (aOR 1.78, 95% CI 1.15-2.73, p=0.009), at term NNU admission for >48hrs (aOR 1.88, 95% CI 1.19-3.00, p=0.007), and children born preterm without long-term physical health sequelae (aOR 1.50, 95% CI 1.00-2.25, p=0.049). Predictive performance of any NNU admission for SEN at age 11 was moderate (AUC 0.70, 95% CI: 1.14-2.65, p=0.010), with balanced sensitivity and specificity and high negative predictive value. Conclusions: NNU admission, for both term and preterm infants, is associated with poorer educational outcomes and an increased likelihood of special educational needs in mid-childhood.

18

General Practice Perspectives on Post-Infection Conditions: Scoping Review and UK Survey

Aung, K. W.; Scuffell, J.; Podlasek, A.; Engamba, S.; Jones, F.; Edwards, A.; Chew-Graham, C. A.; Sanyaolu, L.; Busse-Morris, M.

2026-07-17 primary care research 10.64898/2026.07.15.26358157 medRxiv

Top 7%

0.0%

Show abstract

Background Post-infection conditions (PICs), such as Long Covid, are associated with heterogeneous, fluctuating symptoms that profoundly affect daily functioning. Despite moderate-certainty evidence from the NIHR-funded LISTEN trial (COV-LT2-0009) that personalised self management support improves outcomes and may reduce societal and economic impacts of Long Covid, many people living with PICs still receive condition-specific services, generic advice, or stand-alone digital tools that do not address their complex needs. Aim To map care approaches in general practice and synthesise UK evidence for PIC management. Design and setting Scoping review and online survey. Method A two-phase study was conducted: (1) a scoping review of UK evidence on PIC management in general practice; and (2) a supplementary online survey of practitioners working in UK general practice to provide contextual insights. Results The scoping review identified 32 studies focused on Long Covid. One study included a comparator group (ME/CFS). Study populations were predominantly white ethnicity and female. Evidence for non-Covid PICs in UK general practice was largely absent. The supplementary survey (n=46) provided preliminary practice-level insights. Healthcare practitioners reported varied PIC presentations, diagnostic uncertainty, limited referral pathways, inequitable access, and low confidence in managing PICs. Conclusion Evidence informing PIC management in UK general practice remains predominantly Long Covid-focused and may not reflect the range of PICs encountered in practice. While survey findings are preliminary and require confirmation in larger samples, they highlight uncertainty around PIC management. Further research is needed to evaluate whether existing Long Covid pathways should be expanded or complemented by broader PIC models. Keywords general practice; Long Covid; self-management; post-viral syndromes

19

Temporal relationships between distress and pain in people living with HIV

Arendse, G.; Kamerman, P.; Wadley, A.; Edwards, R. R.; Joska, J.; Parker, R.; Madden, V. J.

2026-07-17 primary care research 10.64898/2026.07.15.26358133 medRxiv

Top 7%

0.0%

Show abstract

Objective: There is a bidirectional relationship between emotional distress and pain. However, this relationship is understudied in people with HIV in low-resource settings. This study sought to describe the temporal relationship between emotional distress and pain in people with HIV. Design: Longitudinal observational study. Methods: Participants with virally suppressed HIV, reporting either no pain or persistent pain at baseline, provided weekly remote ratings of distress, worst pain, and average pain using 0-10 visual analogue scales. Within-individual fluctuations in distress and pain were visualised over time. Group-level correlations were determined using Spearman's correlation tests. Cumulative link mixed models assessed whether distress and pain each predicted the other in the following week. Results: 72 participants provided responses over 49 weeks. The participants had a median (IQR) age of 43 (37-51) years, 63% (n=45) were unemployed and most were females (n=51;71%). Distress and pain fluctuated concurrently within individuals: distress was positively correlated with worst pain ({rho}=0.66, 95% CI= 0.60-0.72, p<0.001) and average pain ({rho}=0.70, 95% CI=0.64-0.75, p<0.001) intensity within the same week. Worst pain (OR=1.42, 95% CI=1.17-1.71, p<0.001) and average pain (OR=1.43, 95% CI=1.20-1.71, p<0.001) intensity both predicted distress in the next week. Distress predicted worst pain intensity (OR=1.25, 95% CI=1.07-1.46, p=0.023) but not average pain intensity (OR=1.19, 95% CI=1.01-1.40, p=0.152) in the next week. Conclusions: The temporal relationship between distress and worst pain intensity was bidirectional, whereas distress did not temporally predict average pain intensity. Both pain and emotional distress should receive attention from HIV research and clinical care in low-resource settings.

20

Trends and variations in Lithium usage across care settings in England between 2015-2024

Schiffer, H.; Fisher, L.; Curtis, H. J.; Wood, C.; Brown, A. D.; Bacon, S. C.; Croker, R.; Goldacre, B.; MacKenna, B.; Speed, V.; Macdonald, O.

2026-07-17 psychiatry and clinical psychology 10.64898/2026.07.15.26357641 medRxiv

Top 7%

0.0%

Show abstract

Lithium has been the gold standard for the treatment and prevention of relapse in bipolar disorder for over 60 years. Guidance from the National Institute for Health and Clinical Excellence states explicitly to 'offer lithium as a first-line, long-term pharmacological treatment for bipolar disorder'. Yet, in the last two decades its use has been in decline with clinicians favouring anticonvulsants or antipsychotics when treating this condition. In this study, we have used three openly available datasets containing prescribing data from primary and secondary care to explore trends in the use of lithium in England, showing both regional and temporal variance between 2015-2024. We have shown that lithium use declined in primary care by 20.9% in the last ten years (2015-2024) and 10.9% overall in the last five years (2019 to 2025). We have also shown how there is some regional variation in the source of lithium for patients, although the vast majority is prescribed in primary care. Further research into clinical behaviour is needed to understand what is driving the decrease in lithium usage, and what barriers and enablers may influence its use across the country.